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Controlling Proteolysis 
through Compartmentalization 

Protein degradation is a necessity for many reasons: 
Homeostasis must be maintained while cellular struc- 
tures are continually rebuilt, in particular during develop- 
ment or in response to external stimuli. Proteins mis- 
folded as a consequence of mutations or ensuing from 
heat or oxidative stress must be scavenged because 
they are prone to aggregation. Beyond these more mun- 
dane "housekeeping" functions, protein degradation 
provides a means to terminate the lifespan of many 
regulatory proteins at distinct times; amongst them are 
cyclins, transcription factors, and components of signal 
transduction pathways (for reviews, see Coux et al. ( 
1996; Hiit and Wolf. 1996; Varshavsky. 1997). Moreover, 
the immune system relies on the availability of immuno- 
competent peptides generated by the degradation of 
foreign antigens (for reviews, see Goldberg et al., 1995; 
Heemels and Ploegh. 1995). 

However, since protein degradation is also a hazard, 
it must be subject to spatial and temporal control in 
order to prevent the destruction of proteins not destined 
for degradation. A basic stratagem in controlling protein 
degradation is compartmentalization, that is, the con- 
finement of the proteolytic action to sites that can only 
be accessed by proteins displaying some sort of degra- 
dation signal. Such a compartment can be an organelle 
delimited by a membrane, as in the case of the lysosome. 
Proteins to be degraded must be imported into the lyso- 
some via specific pathways, and the hydrolases carrying 
out the task must also be sorted from other proteins 
and are translocated by means of transport vesicles. 

Prokaryotic cells, possessing neither membrane-bound 
compartments nor vesicular transport systems, have 
developed a different form of compartmentalization, 
namely self- or autocompartmentalization (Lupas et al. ( 
1 997b). This principle is seen at work in several unrelated 
proteases that have all converged toward a common 
architecture in which proteolytic subunits self-assemble 
to form barrel-shaped complexes. These enclose inner 
cavities, which are several nanometers in diameter and 
harbor the active sites. Access to these inner compart- 
ments is usually restricted to unfolded polypeptides, 
which can pass through the narrow pores or channels 
guarding the entrance. The target proteins thus require 
interaction with a machinery capable of binding and 
presenting them in an unfolded form to the proteolytic 
core complexes; these interactions may be of either a 
transient or a continuous nature. Since protein folding 
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and unfolding are closely related mechanistically, it is 
assumed, but not proven, that this task is performed by 
ATPase complexes, which bear some resemblance to 
the chaperonins and have been referred to as "reverse 
chaperones" or "unfoldases" (Lupas et al., 1993). Since 
their action requires the hydrolysis of ATP, protein deg- 
radation becomes energy-dependent, although the hy- 
drolysis of the polypeptide chain itself is an exergonic 
process. 

Self -compartmentalizing proteases are common in all 
three domains of life: archaea, bacteria, and eukarya. 
This bears testimony to an old evolutionary principle. In 
fact, contrary to organelles such as the lysosome, self- 
compartmentalizing molecular devices offer far greater 
flexibility: when equipped with the appropriate localiza- 
tion signals, they can be deployed to different cellular 
locations in the cytosol or in the nucleus, wherever their 
action is needed. The advances made in recent years 
in understanding the structure of the proteasome and its 
mechanism of action has helped to shape the concept of 
self-compartmentalization, and the proteasome became 
the paradigm of this form of regulation. 

The 20S Proteasome: Core of the 
Proteolytic Machinery 
20S Proteasomes Are Found in All Three 
Domains of Life 

The first description of a "cylinder-shaped" complex 
with proteasome-like features dates back to the late 
sixties. The plethora of names given to it subsequently 
is a reflection of the problems that were encountered 
over a period of two decades in trying to define its 
biochemical properties and cellular functions. Enzymo- 
logical studies revealed an array of distinct proteolytic 
activities and led to a consensus name, "multicatalytic 
proteinase" (Dahlmann et al.. 1988). This name, how- 
ever, was soon replaced by a new one, the proteasome 
(Arrigo et al., 1988), emphasizing its character as a mo- 
lecular machine (for a brief account of the early history 
of this field, see Baumeister et al., 1997). 

At about the same time, it was found that the occur- 
rence of proteasomes was not restricted to eukaryotic 
cells. A compositionally simpler, but structurally strik- 
ingly similar proteolytic complex was found in the arch- 
aeon Thermopiasma acidophifum (Dahlmann et al., 
1989), which later took a pivotal role in elucidating the 
structure and enzymatic mechanism of the proteasome. 
Meanwhile, proteasomes have been identified in several 
archaea (e.g., see Maupin-Furlow and Ferry, 1995), and 
the forthcoming sequence data of more genomes will 
reveal how common they are within this domain. 

The first evidence for the existence of proteasomes 
in bacteria was provided by a database search and 
sequence comparison (Lupas et al.. 1994), which indi- 
cated that two types of proteasomes might exist in bac- 
teria, one represented by the Escherichia cofi hsIVgene 
(Chuang et al., 1993) and one by a gene (prcfi) found 
in the Mycobacterium leprae genome. Subsequent bio- 
chemical and structural studies confirmed the existence 
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of two types of proteasomes in bacteria: HsWarid^ its% 
homology a (though clearly related to (p-type) proteasd- 
fnai subunits, form a simpler complex in which two six- 
nriemb^ 

directly with ah^ATPase of the Clp family (€lpX/HslU) 
(Bochtler et al., 1997; Rohrwild et at., 1997). The second 
type, represented- by Mycdbacierium and' Rhodocdc- 
casjs possibly restricted m its occurrence to actinomy- 
cetales. These proteasomes form a complex indistin- 
guishable in its general architecture from eukaryotic^or 
archaeal 20S proteasomes (Tamura et al, 1995; Zuhl et 
al;i 1997a)r It is conceivable that the actinomycetales 
have acquired the proteasome genes by horizontal gene 
transfer after their separation from other Gram-positive 
bacteria, possibly from a eukaryotic organism, espe- 
cially considering that many actinomycetales live in 
symbiotic association with eukaryotes or are pathogens 
(Lupas et at, 1997a). Under normal growth conditions, 
proteasomes or proteasome-related complexes appear 
not to be essential in eubacteria. M. smegmatis cells in 
which proteasome genes are deleted are viable and 
phenotypically indistinguishable from wild-type cells 
(Knipfer and Shrader, 1997). This is different from the 
situation in yeast, where the disruption of 13 out of the 
14 proteasome genes is lethal (e.g.. see Hilt and Wolf, 
1 995), but is consistent with the observation that protea- 
some-related genes were not found in some recently 
sequenced eubacteria! genomes (e.g., Mycoplasma 



<The 20S Cotv Structure 
MArchaebacteria to Eukaryotes 
Archaeal proteasomes are buflt of 14 copies each of two 
different but related subunits, a and p, whereas eukaryo- 
tic proteasomes contain two copies each of 14 different 
subunits, which, based on their sequence similarity, can 
be divided into an a-type and a P-type group. The sub- 
units are arranged into four seven-membered rings, with 
the a-type subunits forming the two outer rings, and the 
P-type subunits the two inner rings. Collectively, they 
form a barrel-shaped complex, 15 nm in length and 11 
nm in diameter (Figure 1 a), which encloses three internal 
cavities, approximately 5 nm in diameter bounded by 
four narrow constrictions. The central cavity is formed 
by the two adjacent p rings, while the two outer cavities 
(the "antechambers") are formed jointly by one a and 
one 0 ring. 

As anticipated from their sequence similarity, the non- 
catalytic a- and the catalytic p-type subunits have the 
same fold (Lowe et al., 1995; Groll et al.. 1997): a four- 
layer a and p structure with a central five-stranded p 
sandwich flanked on either side by a helices (Figure 1a). 
Helices 1 and 2 mediate the interaction of the a- and p 
rings (p-£rans-a) by intercalating in a wedge-like fashion. 
Helices 3 and 4, which are located on the opposite side, 
provide the dominant contacts between the two p rings 
(p-trans-p). While this general architecture is the same 
in the Thermoplasma and in the yeast proteasome, there 
is a large number of additional specific a-c/s, 0-c/s, 
0-frans-a, and p-frans-p contacts in the yeast protea- 
some as detailed in Groll etal., 1997. Together with the 
propeptides, these contacts seem to ensure that the 
assembly proceeds in an orderly fashion, that is. that 



each of the 14 different subunits takes its correct place 
(see below). 

The main difference between a- and p-type subunits 
is due to a highly conserved N-terminal extension of the 
a-type subunits, part of which (residues 20-30) forms 
an a helix (HO) across the top of the central p sandwich 
(see Figure 1 a). The function of the N-terminal extension 
is not clear, but its location at the top of the a rings 
close to the entrance to the antechambers indicates that 
it may be important for the translocation of substrate or 
for interactions between the proteasome and its regula- 
tory complexes. Instead of this N-terminal extension, 
p-type subunits have prosequences of varying lengths, 
which are removed during proteasome assembly, thus 
rendering the cleft between the central p sandwich freely 
accessible from the central cavity. Although the protea- 
some fold was initially believed unique, it has subse- 
quently been shown to be prototypical of a new family of 
proteins referred to as the IMtn (N-terminal nucleophile) 
hydrolases (Brannigan et al., 1995). 
The Proteasome: A Threonine Protease 
A common feature of the Ntn-hydrolases is a "single 
residue" catalytic center, which is freed by the autocata- 
lytic removal of the prosequence. The identification of 
the N -terminal threonine of the p subunit of the Ther- 
moplasma proteasome as both the catalytic nucleophile 
and the primary proton acceptor came as a surprise 
(Lowe et al., 1995; Seemuller et al., 1995), since, until 
then, the proteasome was widely assumed to be an 
unusual type of serine protease. Beside the N-terminal 
threonine, p subunits require several other residues 
(Glu-17, Lys-33, Asp-166) for activity, although their ex- 
act roles remain to be clarified. Lys-33 and Glu-17 form 
a salt bridge across the bottom of the active site and 
may participate in the delocalization of the threonine 
side-chain proton by forming a charge relay system. 
Further evidence supporting the role of the N-terminal 
threonine came from work with the antibiotic lactacystin, 
or more accurately, its active form clasto-lactacystin 
p-lactone (Dick et al., 1996a), which covalently reacts 
with the Thr-1 of a specific subset of p-type subunits 
of mammalian proteasomes (Fenteany et al., 1995). The 
lactacystin experiments as well as recent mutational 
studies with yeast and mammalian p-type subunits 
(Chen and Hochstrasser, 1996; Schmidtke et al.. 1996; 
Arendt and Hochstrasser, 1997) confirmed a prediction 
made on the basis of sequence comparisons that the 
proteolytic mechanism is the same in eukaryotic and in 
Thermoplasma proteasomes (Seemuller et al., 1995). 
From the conservation pattern of the active site resi- 
dues, it was further deduced that, of the seven p-type 
subunits in an individual eukaryotic proteasome. only 
three are proteolytically active (see Figure 2a). Since 
each of them is present in two copies, the number of 
active sites in the central cavity is 6, instead of 14 as in 
the Thermoplasma proteasome. The reasons for this 
reduction in the number of active sites are unclear, and 
also, the function of the "inactive" P-type subunits 
awaits further clarification. Mutagenesis and chemical 
modification (e.g.. see Heinemeyer et al., 1993; Enenkel 
et al.. 1 994; Arendt and Hochstrasser, 1 997) have shown 
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Figure 1. Structure of the Thermopiasma 20S Proteasome 

(a) Low resolution model (~1.2 nm) derived from the atomic coordinates of the Thermopiasma proteasome (Lowe et aL, 1995). The a subunits 
form the heptameric outer rings; the £ subunits, the inner rings. This general architecture is conserved from Thermopiasma to higher eukaryotes 
(left). Individual a and £ subunits are shown as ribbon drawings (right). 

(b) Ribbon drawing of the 20S proteasome indicating one subunit in each of the four rings by different color coding (left). A half-proteasome 
(one o and one p ring) viewed down the 7-foW axis. The Tyr-126 residues in the loops between helix 2 (H2) and 3-sheet 5 (SB), which line the 
polypeptide channel giving access to the interior, are marked by small white spheres (right). The scale bar is 10 nm. 



that each of the three major activities of eukaryotic pro- 
teasomes (chymotrypsin-like activity, trypsin-like activ- 
ity, and peptidylglutamyl peptide hydrolyzing activity), 
as defined by the degradation of short fluorgenic pep- 
tides, can be abolished by modifying either an active or 
an inactive subunit. It is not clear whether special pairs 
of active and inactive subunits exist, which interact allo- 
sterically, or whether mutations of inactive subunits sim- 
ply tend to cause structural perturbations with a high 
likelihood of affecting the neighboring active subunits 
(see Figure 2b). 

Access to the Central Cavity Is Restricted 
to Unfolded Proteins 

Another feature apparently distinguishing the yeast pro- 
teasome and the Thermopiasma proteasome structure 
relates to the openings that give access to the inner 
cavities. The crystal structure of the Thermopiasma pro- 
teasome revealed the existence of a channel located 



along the 7-fold axis (Figure lb). This channel is defined 
by a turn-forming segment with Tyr-126 at a strategic 
position; collectively, the seven turns form a hydropho- 
bic annulus well suited for the translocation of an un- 
folded polypeptide. Direct evidence that this channel is 
the port of entry was obtained by electron microscopic 
visualization of a Nanogold-labeled insulin 0 chain caught 
in transit; the relatively bulky (approximately 2 nm) gold 
label prevents it from slipping across the channel into 
the antechamber (Wenzel and Baumeister, 1995). In the 
yeast proteasome, this channel is occluded by the 
N-terminal residues of the a subunits, which are disor- 
dered in the Thermopiasma proteasome. On the other 
hand, the yeast proteasome has small openings situated 
at the interface between a and 0 rings; this led to specu- 
lations that the side windows may be used to take up 
substrate. However, it is not very likely that such highly 
conserved structures use radically different routes for 
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Figure 2. Relatedness and Topology of Subunits of Eukaryotic Proteasomes 

(a) The dendrogram shows the classification of subunits into a and p types, and the division of p subunits into an "active" (closed boxes) and 
an "inactive" (open boxes) branch (for details, see Lupas et al.. 1994). The terms active and inactive refer to the presence or absence of Thr-1 
as the N-terminal nucleophile. For nomenclature, see <c). Ta-o and Ta-p represent Thermoplasma acidophilum a and p subunits, respectively. 
Oa) Localization of the 28 subunits within the eukaryotic proteasome. The diagram represents an unrolled cylinder of the 20S proteasome 
showing the positions of a and £ subunits as determined by the crystal structure of the yeast proteasome (Groll et al., 1997). A mapping of 
the subunits of the human proteasome based on immunoelectron microscopy revealed a remarkably similar topology (Kopp et al. 1997). The 
color coding denotes pairs of active (closed boxes) and inactive (open boxes) subunits responsible for specific activities as deduced from 
mutagenesis experiments, Subunits forming such pairs are in direct contact to each other, either in c/s (i.e.. within one ring) or in trans (i.e.. 
between two rings) as indicated by arrows. The prime symbol distinguishes equivalent subunits within the two a and the two p rings, 
(c) Nomenclature of 20S proteasome subunits. The new systematic nomenclature is based on the location of the yeast subunits within the 
seven-membered rings relative to the C2 symmetry axis. The exchangeable subunits. which are present only in the immuno proteasomes of 
higher vertebrates, are marked with V. 



translocation. Moreover, the side windows would be 
rather ill-positioned for the uptake of polypeptides when 
the 20S proteasome associates with its regulatory com- 
plexes. It is more likely, therefore, that eukaryotic pro- 
teasomes use the central channel, which then must un- 
dergo a conformational change. It is noteworthy in this 
context that eukaryotic but not Thermoplasma protea- 
somes are purified in a latent state and need to be 
activated by chaotropic agents or heat. These treat- 
ments may selectively unfold the N-terminal sequence, 
resulting in an opening of the channel. Under physiologi- 
cal conditions, a gating of the channel could be con- 
trolled by regulatory complexes that dock to the two 
termini of the 20S complex. While the side windows are 
unlikely to be used for the entry of substrate, it is possi- 
ble that they have a role in discharging degradation 
products. 



Polypeptides to be degraded must wind their way 
through a system of internal cavities and constrictions 
until they reach the active sites in the central cavity, at 
least 8-10 nm away from the orifice at the center of 
the a ring (see Figure 3). The underlying mechanism of 
translocation is unknown, as is the precise role of the 
two antechambers. One could envisage that when the 
20S proteasome is associated with regulatory com- 
plexes which unfold substrate proteins in an ATP- 
dependent manner, unfolding and translocation of the 
polypeptide are coupled, and thus, that the unfolded 
chain is "pushed" into the 20S core. However, when 
the unfolded polypeptide alone is offered to the 20S 
proteasome in vitro, it is capable of degrading it; thus, 
the translocation is not strictly energy-dependent. The 
structural properties of the interior of the proteasome 
should bias the random walk of the polypeptide chain 
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Figure 3. Hydrophobicity of Inner and Outer Surface Regions of the 
Ttiermopfasma 20S Proteasome 

The hydrophobicity of the amino acids was taken from Pickett and 
Sternberg, 1993. and mapped to the molecular surface. The left side 
shows a cut-open model of the complex with the antechambers 
(AC) formed jointly by an a and p ring, and the central cavity (CC) 
formed by two 0 rings. The cutting surface is shown in dark gray. 
On the right side, the outer surface of one a ring is shown. The 
scale bar shows the color coding of the hydrophobicity in kcat/mol. 

toward the active side clefts; the antechambers, which 
have a volume of ~59 nm J ( must be able to maintain 
the polypeptide in an unfolded form as it passes through 
them. It is possible that the existence of constrictions 
segregating the front end from the rear end of a translo- 
cating polypeptide chain serves as a means to prevent 
refolding. The crystal structure of the chaperonin GroEL 
in complex with its co-chaperonin GroES (Xu et al., 1 997; 
for review, see Bukau and Horwich, 1998, this issue of 
Celt) illustrates how properties of cavities may determine 
interactions with normative polypeptides. Upon binding 
of GroES, one of the two heptameric GroEL rings (the 
cis ring) undergoes a major structural rearrangement 
and, concomitantly, the properties of the cis cavity 
change: while the trans cavity is lined with hydrophobic 
residues, favoring the interaction with nonnative poly- 
peptides, the cis cavity exposes mostly polar residues 
and thus repels nonnative polypeptides. In terms of 
overall hydrophobicity, the antechambers of the protea- 
some assume an intermediate position. Upon careful 
inspection, one may discern an array of paths where 
hydrophobic residues are clustered (Figure 3); in spite 
of their meandering appearance, these paths seem to 
connect the a-ring channel with the inner constriction 
and thus may give direction to a polypeptide chain on 
its way into the central cavity. 

The central cavity, which is less hydrophobic than the 
antechambers, has a volume of ~84 nm 3 , allowing it, in 
principle, to accommodate a single folded protein of 
~70 kDa; a loosely packed unfolded polypeptide re- 
quires much more space. Since polypeptides can only 
enter the cavity one after the other, the central cavity will 
not usually accommodate more than one polypeptide at 
a time. The confinement of the substrate to this cavity 
with its 6-14 active sites provides the structural basis 
for the processive mode of action of the proteasome; it 
completes the degradation of one polypeptide before 
attacking the next (Akopian et al., 1997). 
Characteristics of the Degradation Products 
Eukaryotic proteasomes have three major peptidase ac- 
tivities that have been defined using short fluorogenic 



peptide substrates: a chymotrypsin-like activity, which 
cleaves after hydrophobic residues, a trypsin-like activ- 
ity, which cleaves after basic residues, and a peptidyl- 
gfutamyl peptide-hydrolyzing activity, which cleaves 
after acidic residues. Two additional specificities have 
been identified in mammalian proteasomes, cleaving 
after branched chain residues and between small neutral 
amino acids (for review, see Cardozo, 1993). The protea- 
somes of Thermoplasma and of the bacterium Rhodo- 
coccus have only chymotrypsin-like activity, consistent 
with the fact that they have only one type of active site 
(Dahlmann et al.. 1992; Tamura et al., 1995). Mutational 
studies and the crystal structure of the yeast protea- 
some can be used to assign these activities to distinct 
subunits (see Figure 2), but they have little (if any) rele- 
vance to the cleavage specificity of protein substrates. 
Degradation studies performed with Thermoplasma 
proteasomes (Wenzel et al., 1994) and eukaryotic pro- 
teasomes (e.g., see Dick et al. 1994; Ehring et al., 1996) 
have yielded cleavage patterns that do not correlate 
with the aforementioned specificities. Obviously, a clas- 
sification of cleavage specificities using the residue di- 
rectly adjacent to the cleavage site (the P1 position) falls 
short of reality. 

It was intriguing to observe that, in spite of cleaving 
protein substrates in a rather nonspecific manner, the 
generated products fell into a relatively narrow size 
range, averaging around 7-9 residues. This property, 
which is common to prokaryotic and eukaryotic protea- 
somes. led to the proposal that an intrinsic "molecular 
ruler" determines product length (Wenzel and Baumeis- 
ter. 1995). It was envisaged that the distance between 
active sites, acting in concert, could provide the physical 
basis of such a ruler. The crystal structure of the Ther- 
moplasma proteasome revealed a distance of 2.8 nm 
between neighboring active sites corresponding to a 
hepta- or octa-peptide in an extended conformation; 
thus, it seemed to provide strong evidence in support 
of the molecular ruler hypothesis. On the other hand, 
recent more quantitative analyses of product lengths 
(Kisselev et al.. 1998). while in agreement with the aver- 
age length, showed larger size variations, which may be 
difficult to reconcile with a purely geometry-based ruler. 

It is now well established that the proteasome has an 
important role in generating immunocompetent pep- 
tides to be displayed by the MHC class I complex. Obvi- 
ously, the evolution of the proteasome predates the 
evolution of the immune system, and the availability of 
peptides between 7 and 9 residues long must have had 
a profound influence on the evolution of the MHC class 
I system (Niedermann et al., 1997). In turn, the protea- 
some seems to have responded to the need of the immune 
system for specific peptides by developing variants of 
some of its p-type subunits, which upon induction by 
7-interferon can replace their constitutive counterparts 
in the 20S complex, thus allowing further modulation 
of specificity (for reviews, see Goldberg et al., 1995; 
Heemels and Ploegh, 1995). 
Proteasomes Carry Cellular Localization Signals 
Proteasomes are located both in the nucleus and in the 
cytoplasm of eukaryotic cells, and they undergo a cell 
cycle-dependent redistribution (Amsterdam et al., 1993; 
Palmer et al., 1 994). Proteasomes indeed possess multi- 
ple nuclear localization signals (NLS sequences) that 
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are present on two (yeast) or four (human) a-type sub- 
units. They are located on either one of two surface- 
exposed loops that are situated on the outer periphery 
of the a rings in the assembled complex. The functional- 
ity of these sequences has been demonstrated by 
means of reporter molecules to which peptides with the 
respective NLS sequences were coupled (Nederiof et 
al. r 1995) as well as by mutational studies (Wang et al., 
1 997a). Signal-mediated changes in the cellular location 
of the proteasome add a layer of complexity to the spa- 
tial and temporal regulation of proteolysis. 
The Propeptide Has Important Roles in the 
Assembly Pathway 

The assembly of the 20S proteasome is intimately cou- 
pled to the processing of the p-type subunits, which 
renders them active. Posttranslational removal of a pro- 
peptide is a common mechanism for controlling the ac- 
tivity of proteolytic enzymes and perhaps best studied 
for extracellular bacterial proteases such as the a-!ytic 
protease. Propeptides may have functions beyond pre- 
venting premature activation; they can promote folding, 
acting as intramolecular chaperones, or serve as signals 
that determine the cellular localization of the enzyme 
(for review, see Baker et al. r 1993). In the proteasome, 
the proregion is removed at a late assembly stage, thus 
deferring activation until the p subunits are confined to 
the inner cavity of the complex. The cleavage reaction 
proceeds via an autocatalytic mechanism relying on the 
active site threonine (Thr-1). The invariant glycine pre- 
ceding it (Gly-1) is the prime (if not the only) determinant 
of the cleavage site (Chen and Hochstrasser. 1996; 
Schmidtke et al., 1996; SeemOiler et ah. 1996). It is as 
yet unclear whether the autocatalytic reaction is intra- 
molecular (c/s) or intermolecular {trans) or whether both 
mechanisms apply. In the Thermoplasma proteasome. 
coexpression of mutant, inactive p subunits and wild- 
type p subunits leads to efficient processing of the inac- 
tive subunits, which demonstrates that intermolecular 
processing is possible (SeemOiler et al.. 1996). For eu- 
karyotic proteasomes, where several subunits have pro- 
peptides much longer than the eight residues found in 
Thermoplasma, a two-step model has been proposed: 
in the first step, the propeptide is trimmed in size by an 
intermolecular mechanism; and in the second step, the 
residual propeptide of 8-1 0 residues is then removed via 
an intramolecular mechanism (Schmidtke et al., 1996). 

Archaeal, bacterial, and eukaryotic proteasomes do 
not seem to follow exactly the same assembly pathways. 
This reflects the increase in complexity and the corre- 
sponding need in eukaryotes to orchestrate the orderly 
assembly of 14 different subunits. The sequence of 
events that lead from individual subunits to mature pro- 
teasomes are best understood for the Thermoplasma 
proteasome (Zwickl etal., 1994; SeemOiler et al., 1996). 
Here, the a subunits assemble spontaneously into 
seven-membered rings in the absence of p subunits. In 
contrast, the p subunits alone are unable to assemble, 
remain unprocessed, and do not even fold completely. 
Thus, it appears that the a rings serve as a template 
upon which the p subunits assemble. Interestingly, at 
least one of the seven human a-type subunits forms 
ring structures resembling those of Thermoplasma a 
subunits (Gerards et al., 1997). Although it remains un- 
proven that a rings are an assembly intermediate in vivo. 




Figure 4. Assembly Pathway of the Rhodococcus Proteasome 
The first step in the assembly pathway is the formation of oJ$ hetero- 
dimers (!) followed by the assembly of half proteasomes 00- Half 
proteasomes dimerize to form preholoproteasomes (III), which are 
converted into holoproteasomes (IV) through the autocatalytic re- 
moval of the propeptide. 

a recent study analyzing precursor complexes of mouse 
proteasomes with subunit-specific antibodies lends 
support to such a role (Nandi et al., 1997). These authors 
propose that assembly is initiated by the formation of 
a ring containing all seven a-type subunits and then 
continues with the sequential addition of the seven 
p-type subunits. Recent experiments with the Rhodo- 
coccus proteasome suggest a variant pathway: while 
recombinant a- and p subunits assemble efficiently in 
vivo and in vitro, neither of the two components alone 
yields any detectable assembly product; hence, one 
must conclude that an a/p heterodimer is an early inter- 
mediate in the pathway (Zuhl et al., 1997b). 

Proteasome propeptides are not only highly variable 
in length and sequence, but also in their importance for 
the assembly process. While deletion of the 65 residue 
propeptide of Rhodococcus^ subunits results in a dras- 
tically reduced yield of mature proteasomes (Zohl et 
al., 1997b), deletion of the 8 residue propeptide of the 
Thermoplasma p subunrt has very little effect (Zwickl 
et al., 1994). Not surprisingly, therefore, Thermoplasma 
proteasomes can be reconstituted efficiently in vitro fol- 
lowing complete dissociation or even unfolding (Grziwa 
et al., 1994). Dictyostelium proteasomes cannot be re- 
constituted in vitro, indicating that, once processed, the 
subunits have lost the ability to find their correct place 
in the complex (Schauer et al., 1993). In yeast, expres- 
sion of the p (Doa3) subunit without its prosequence 
also results in a failure to incorporate this subunit into 
the complex (Chen and Hochstrasser, 1996). Since Rho- 
dococcus proteasomes, which are composed of four 
subunits (al, a2, p1, and p2). assemble very efficiently 
in vitro with any combination of a and p, this system 
offers a means to dissect the assembly pathway (Zuhl et 
al., 1997b). Separately, Rhodococcus a and p subunits 
remain monomelic and inactive. When the subunits are 
allowed to interact, active proteasomes form spontane- 
ously. Therefore, the earliest intermediate is probably 
an a/p heterodimer (I in Figure 4). The heterodimers 
assemble further into half proteasomes built from a sub- 
units and p-subunit precursors (II in Figure 4). Previously 
described 1 3S-1 6S precursor complexes of mammalian 
proteasomes may represent the eukaryotic counter- 
parts of these intermediates (e.g., see Yang et al.. 1995; 
Schmidtke et al., 1997). Half proteasomes remain inac- 
tive even when the propeptide is absent due to genetic 
deletion. Assembly proceeds via dimerization of half 
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proteasomes and is completed by the final conversion 
of preholoproteasomes into holoproteasomes (111 and IV 
in Figure 4). The last step involves the cleavage of the 
propeptide, which is the rate-limiting step in the assem- 
bly pathway. Deletion of the ^-propeptide retards but 
does not completely prevent proteasome assembly; ad- 
dition of the propeptide in trans can compensate for the 
deletion. In that case, formation of holoproteasomes 
from half proteasomes is accelerated, because the 
cleavage reaction becomes unnecessary. These obser- 
vations indicate that the role of the propeptide is two- 
fold: It supports initial folding of the 0 subunits. thus 
acting as an intramolecular chaperone, and it promotes 
the maturation of holoproteasomes following the dock- 
ing of two half-proteasomes. 

The Proteasome and Its Regulatory Complexes 
The 19$ Cap Complex Renders 
Proteolysis Energy-Dependent 
In electron micrographs, the W 26S" proteasome (a more 
accurate value is 30.3S; Yoshimura, et al. 1993) appears 
as an elongated structure (~45 nm long) consisting of 
a central 20S complex capped at either one or both 
ends by the 19S complexes. These 19S caps, which 
have a molecular mass of approximately 700 kDa. serve 
to recognize ubiquitylated proteins and to convert them 
into a form competent for degradation by the 20S core 
complex. When bound to the 20S complex, the two 19S 
caps face in opposite directions, reflecting the underly- 
ing C2 symmetry of the 20S proteasome (Peters et at., 
1993). The coexistence of symmetric and asymmetric 
26S complexes is reminiscent of the situation with 
GroEL-GroES chaperonin complexes, where there has 
been some controversy regarding the functional signifi- 
cance of the two species (Schmidt et al., 1994; Engel 
et al., 1995). Similar to the situation with GroEL-GroES, 
we do not see any compelling reason why a symmetric 
complex should be obligatory; in fact, an asymmetric 
complex seems to be better suited for performing a 
vectorial process involving the uptake of substrate and 
the release of product. 

The 19S caps are flexibly linked to the 20S core com- 
plex. As illustrated in Figure 5a, they undergo a "wag- 
ging" type of movement; in symmetrical complexes, the 
movement of the two 19S caps appears to be noncorre- 
lated (Walz et al., 1998). We are not yet in a position to 
say whether such a movement is functionally relevant 
and, perhaps, dependent on ATP-hydrolysis or whether 
it merely reflects some internal flexibility. It is noteworthy 
that very similar characteristics and amplitudes of move- 
ment are observed in 26S proteasome preparations iso- 
lated from different sources and in different types of EM 
preparations. In any case, the lability of the complex 
and the existence of such a flexible linkage make the 
structural analysis of the 26S proteasome a cumber- 
some task. A low-resolution 3D map has recently been 
obtained following segmentation of the complex on the 
level of image analysis; a composite model combining 
the separately determined structures of the 19S caps 
and the 20S core is shown in Figure 5b. The model 
provides a first insight into the remarkable complexity of 
the 19S structure. Although it may be of little immediate 




Figure 5. The 26S Proteasome 

(a) Two-dimensional average of the 26S proteasome from Drosoph- 
ila mefanogaster obtained from electron microscopic images of neg- 
atively stained specimens. Image analysis of a large dataset of parti- 
cles revealed the flexible linkage ofthe19Scapstothe 20S core. The 
figure shows two states of the complex at the maximum amplitude of 
the wagging movement superimposed, one in color-scale, the other 
in gray-scale. The direction of the motion is indicated by the white 
arrows. From statistical analysis, it can be deduced that the motions 
of the left and right 19S caps are not correlated. The scale bar is 
20 nm. 

(b) The 3D structure of negatively stained 19S caps was calculated 
using the method of random conical tilting. The figure shows a 
composite model of the electron microscopic structure of the 19S 
caps combined with the low-pass filtered 20S proteasome. 

usefulness, it will serve as a platform for an accurate 
mapping of the approximately 20 different subunits and 
for monitoring, by time-resolved EM, the fate of sub- 
strates "en route" through this labyrinth of subunits. 

Much progress has been made in recent years in de- 
fining the components of the 19S complex, which is 
simitar if not identical to PA700 (a proteasome activator 
of 700 kDa (for review, see Peters, 1994]). Therefore, in 
compiling a listing of its component subunits (Table 1), 
we have not discriminated between the two complexes. 
Currently 15 different subunits (not including homologs) 
have been defined through sequencing; at least 3 more 
subunits have been identified by SDS-gel electrophore- 
sis only. Six of the subunits are ATPases and members 
of the AAA-ATPase family (ATPases associated with a 
variety of cellular activities); the proteasomal ATPases 
form a distinct branch within this superfamily. A hallmark 
of the 19S cap ATPases, (with one possible exception, 
S4), is a predicted coiled coil segment near the N termi- 
nus (Lupas et al., 1993). Six genes encoding ATPases, 
which are members of this subfamily, are found in the 
yeast genome. In archaea, close relatives of the 19S 
cap ATPases have been discovered in the genomes of 
Methanococcus jannaschii and Archeoglobus fulgidus. 
In actinomycetes, a more distantly related ATPase of 
the AAA family has been found in the vicinity of the 
proteasome genes; when expressed in E. coli, it forms 
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Table 1. Subunits of the 19S Complex (Human and Yeast) 



Subunit* 


Name 6 


Accession' 


Structural Properties/Sequence Similarities 


Phenotypes of Mutants 


SI 


p112 


D44466 


WrVJi icon (o rv c r\ w 1 1 luui^i r>u i it io i n Y iaj *j£ 


mutation causes defects in ubiquitin-dependent 


(1 1U 


oeno 


L06321 


oikj dUuunio or uiu t\Jo cywKJb>ornc ui 




KU3) 






anapriase- promoting ecu i ipiex/Mrv. 


nmtty'm trancrwvt* mnw inforart w/ith Mini |Q1 A\* 








\Lupas ana csaumeisicr, lyy/j 


Crtri^ InforaHc urith tDKIA crti«f~irwi f»<~trir C/ml 
Oct lo niiciati) wiui irviXM sjjiH-h ty idLim jcii i 










|L/etVlol 1111 ei <JI., I iJjZl, I UKUld til al., 1 


CO 


nQ-l (CC ft 

py / (ob. II, 


Uro ID I 


contains KEKE motifs; similarity to SI and 


33. Mil rvHr -£/ UlilUS Itl I Wr lecepiOl Ij JJt; 1 


/inn 


toad 9 nRT\ 




subunits of the 20S eye lo so mo or 


ou RJ may ue invuiveu hi ntuu^eu ceii ueoui iduiuui 


kOa) 


Nasi 


i iimoa 


anaphase-promoting complex/ APC (Lupas 


ei ai., oias i mutants accumulate 








and Baumeister, 1997) 


uotquityiateu proteins (isurumi et ai., i99oj 


S3 


p58 


UUDJ.b/Ozb 




Sun2 may function in cell cycle similar to Ninl 


(61 kDa) 


Sun2 


111 8778 




(S14); mutants accumulate ubiquity tated proteins 










(Komtnamt et al.. 1997) 


S4 


p56 


L02426 


member of AAA ATPase superf amity 




(52 kDa) 


Yta5 


X81070 


(Confalonien and Duguet 1995); binds to S7 










in vitro (Richmond et al., 1997) 




S5a 


S5a (AF-1) 


U 51 007 


contains KEKE motifs; similarity to human 


although this subunit shows in vitro high affinity 


(50 kDa) 


Mcbl (Sunl) 


D78022 


BTF2/p44 and homokxrs (bask: 


to pofyubiquitin (Deveraux et aL, 1997; Kominami 








transcription factors) 


et al.. 1997), it seems not to be essential for 










ubiquitin-dependent proteolysis (van Nocker 










et al., 1996); S5a restores DNA binding capacity 










of HLH proteins MyoD and El 2 by interacting 










with Id! (Anand et al., 1997) 


S5b 


S5b 


S79862 


contains 9 diteucine repeats 


— 


(50 kDa) 


no yeast homo log 








S6 


p48 fTbp7) 


Sp: P43686 


member of AAA ATPase superfamily; binds 


Tbp7 was identified as H IV -Tat binding protein 


(48 kDa) 


Yta2 


U 06229 


to S8 in vitro (Richmond et al.. 1997) 


(Dubiel et al., 1994) 


S6' 


p50 (Tbpl) 


M34079 


member of AAA ATPase superfamily; binds 


Tbp1 (HIV-Tat binding protein) is a putative 


(48 kDa) 


Ytal 


Z78025 


to SI 0b in vitro (Richmond et at., 1997) 


transcriptional activator (Ohana et al., 1993) 


S7 


Mssl 


D11094 


member of AAA ATPase superfamily; binds 


overexpressed Mssl enhances Tat -mediated 


(47 kDa) 


Cim5 (Yta3) 


Z22817 


to S4 in vitro (Richmond et al.. 1997) 


transact'rvation of HIV genes (Shibuya et al.. 










1992); cim5 mutation stops cell division and 










causes accumulation of B-typc cyclins (Ghistain 










et al.. 1993) 


S8 


p45 (Tripl) 


L38810 


member of AAA ATPase superfamily; binds 


Sugl is a putative transcriptional mediator (Lee 


(46 kDa) 


Sugl (Cim3) 


X 66400 


to S6 in vitro (Richmond et al., 1997) 


et al., 1995; Xu et al., 1995; Rubin et ai. t 1996; 








Wang et al., 1996, Fraser et ai., 1997; Weeoa et 










al., 1997); Cim3 mutation stops cell division and 










causes accumulation of B-type cyclins (Ghislain 










et ai., lyyjj 


S9 


S9 


AF001212 


contains 9 dileucine repeats 




(46 kDa) 


Off 


X95644 






SlOa 


S10a (Orf07) 


D14663 


similarity to Arabidopsis fus6 and homologs 


Fusca proteins are critical for plant development 


(44 kDa) 


Off 


U32445 


in human and rat (Fusca proteins) 


(Castle and Meinke, 1994) 


Si Ob 


p42 


D78275 


member of AAA ATPase superfamily; binds 


Sug2 was identified as suppressor of gai4 


(44 kDa) 


Sug2 (Pcsl) 


U93262 


to S6' in vitro (Richmond et al.. 1997) 


(Russell et al. 1996); mutants fail to duplicate the 










spindle pole body (McDonald and Byers, 1997) 


S11 


unknown 








(43 kDa) 










S12 


p40 (S12) 


D5O063 


contains KEKE motifs 




(36 kDa) 


Nas3 (YOR261C) 


Z75169 






S13 


unknown 








(32 kDa) 










S14 


p31 


D38047 




Ninl is required for activation of cdc28 kinase; 


(30 kDa) 


Ninl 


D10515 




mutation stops cell cycle (Gordon et at., 1996; 










Kominami et al.. 1997) 


S15 


unknown 








(25 kDa) 











■ Subunits are numbered according to their SDS-PAGE mobility (Dubiel et al., 1995). 

b First line: name of the human subunit; second line: name of the yeast subunit 

c Genbank accession; exceptions: DDBJ. DNA database of Japan; Sp. Swiss-protein. 

d SDS-PAGE apparent molecular mass. ± 3 kDa depending upon the gel system (Dubiel et al., 1995). 



a ring-shaped complex with ATPase activity (Wolf et al., 
1998). This suggests that the proteasomal ATPases form 
six-membered homomeric rings in archaea and bacte- 
ria, and six-membered heteromeric rings in eukarya. 
From the structure of simpler ATP-dependent prote- 
ases, such as HsIVU and CIpAP, it has been inferred that 



in the 19S cap complex, they also form a ring docking 
directly onto the a rings and surrounding the orifice of 
the 20S channel. Although plausible, rigorous experi- 
mental proof for such a topology is currently lacking. 
Also, the precise role of the ATPases, that is, the energy- 
dependent step in the degradation of proteins, awaits 
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clarification. It may include the recognition of target pro- 
teins, their dissociation and unfolding, their transloca- 
tion into the 20S inner cavities, and/or the gating of 
the channel Dissociation and unfolding are particularly 
likely steps of ATP involvement; however, an unfoldase 
activity has yet to be demonstrated. 

19S cap complexes have only been found in eukary- 
otic cells, indicating that the non-ATPase subunitsof the 
complex have been added to the proteasome-ATPase 
complexes late in evolution. Several of them are required 
to link the proteasome to the ubiquitin system (ubiquitin- 
binding, deubiqurtylation), which through its array of li- 
gating enzymes (for review, see Hochstrasser, 1996) 
confers specificity to the proteasome. This, however, 
does not imply that the proteasome acts exclusively in 
the context of the ubiquitin pathway. The functions of 
most of the non-ATPase subunits of the 19S cap com- 
plex are still unknown (Table 1). The large variety of 
phenotypes associated with disruption of their genes is 
a reflection of the involvement of the proteasome in a 
large number of cellular processes but gives only vague 
hints as to their biochemical function. 
TTie 11S Complex: An A TP-independent Activator 
The regulator, variously referred to as 11 S or PA28, was 
originally identified as an activator of the peptidase ac- 
tivities of the 20S proteasome from mammalian cells 
(Chu-Ping et a!., 1992; Dubiel et at., 1992). It has no 
hydrolytic activity of its own, but when combined with 
20S proteasomes it accelerates the degradation of fluo- 
rogenic peptides in an ATP-independent manner. How- 
ever, it does not stimulate the degradation of proteins 
or of ubiquctin-protein conjugates. Since free peptides 
are short-lived and therefore rare in cells, the in vivo 
relevance of this complex is not obvious. Furthermore, 
its occurrence seems to be restricted to higher eukary- 
otes— no homolog exists in yeast— indicating that it has 
a specific rather than a basic function in the proteasomal 
degradation pathway. Most likely, these functions are 
related to the proteasome's involvement in antigen 
processing. The first evidence for a role in this context 
came from experiments demonstrating that the synthe- 
sis of 11S/PA28 is strongly induced by the cytokine 
7-interferon (Realini et al., 1994). Similarly, ^-interferon 
induces synthesis of three 0-type subunits of the 20S 
proteasome, LMP2. LMP7, and MECL-1, which replace 
their constitutive counterparts (see Figure 2) in the 
"immunoproteasome." Meanwhile, further evidence has 
accumulated supporting a role of 11S/PA28 in antigen 
processing. The cleavage of fiuorogenic peptides by 
LMP2/7 containing proteasomes is enhanced in its pres- 
ence (Ustrell et al., 1995), and the repertoire of peptides 
generated from a polypeptide becomes larger (Groet- 
trup et al., 1995). Using natural peptides, it was shown 
that the presence of PA28 changes the single-cleavage 
mode of 20S proteasomes to a coordinated double- 
cleavage mechanism, thereby optimizing the generation 
of dominant T-cell epitopes (Dick et al., 1996b). Finally, 
the observation that overexpression of PA28 to a level 
similar to that obtained after ^-interferon treatment 
markedly enhanced the efficiency of viral antigen pro- 
cessing establishes the role of PA28 in vivo (Groettrup 
et al., 1996). Recently it was proposed, based on ex- 
periments with the HIV-1 Tat protein, that viruses may 




Figure 6. The 11S/PA28 Regulator 

(a) Two-dimensional average of the PA28-20S complex from the 
rat 

(b) Composite surface model of the PA28 regulatory complex de- 
rived from the atomic coordinates and the 20S proteasome. The 
scale bar is 20 nm. 

(c) Ribbon model of the atomic structure of the recombinant human 
PA28 complex as determined by X-ray crystallography (Knowlton 
et at., 1997). Side view (left) and top view (right). The scale bar is 
10 nm. 

escape from immune surveillance by preventing the for- 
mation of 20S/11S complexes (Seeger et al., 1997). Al- 
though the role of PA28 in antigen processing appears 
to now be well established, the mechanism by which 
PA28 modulates the activity of the 20S proteasomes 
remains enigmatic. 

Structurally, PA28 forms multimeric ring-shaped com- 
plexes (Gray et al., 1994) that bind to the two a rings of 
the 20S proteasome (Figure 6a). It is built from two types 
of subunits of approximately 30 kDa, PAa and PAp, 
which are about 50% identical in sequence. Biochemical 
data suggest that PAa and PAp form heteromeric rings 
containing the two subunits in a 1 :1 stoichiometry (Ann 
et al., 1996; Kuehn and Dahlmann, 1996; Song et al., 

1996) ; this would argue for an even number of subunits, 
most likely six, in the complex. Recombinant PAa as- 
sembles into seven-membered rings (Knowlton et al., 

1997) , which are able to stimulate 20S proteasomes 
similarly to the native PAo/p (Realini et al., 1994; Song 
et al., 1996). PA0 alone does not form multimers nor 
does it exhibit stimulatory activity (Kuehn and Dahl- 
mann, 1996). This indicates that PAa mediates associa- 
tion within the 11S complex and between this complex 
and the 20S proteasome; a short carboxyl -terminal se- 
quence of PAa was found to be critical for this role 
(Song et al., 1996). Interestingly, the two PA28 subunits 
have sequence similarity (about 35% identity) to a third 
protein, the nuclear Ki antigen found in systemic lupus 
erythematosus patients. The function of this protein, 
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Figure 7. Synopsis of Structures of Self-Compartmentalizing Proteases Represented as Surface Models with Equal Scaling 

(a) 20S proteasome of Thermopiasma acidopNIum (Lowe et at, 1995). Side view (left), top view (center), and cut-open side view with the 
proteolytic sites marked in red (right). 

(b) CIpP protease from E co/r (Wang et al., 1997b). 

(c) Tricom protease from Thermopiasma acidophUum (Walz et al.. 1997). On the left a single hexameric tricom molecule is shown; it was 
token from the 30 reconstruction of the icosahedral tricom capstd (right). 

(d) Gal6/bleomycin hydrolase from yeast (Joshua-Tor et al., 1 995). 

The structures in (a) and (b) were low-pass filtered to 1.2 nm, and the structures in (c) and (d) to 2.5 rtm. The scale bar is 20 nm. 



which is highly conserved among species, is still un- 
known. Its relationship to the subunrts of PA28 and the 
fact that it is upregulated by -y-interferon, points to a 
possible role in immunity. However, unlike PAa and p\ 
the occurrence of the Ki antigen is not restricted to 
higher vertebrates (Paesen and Nuttall, 1996). 

Recently, the crystal structure of a heptameric com- 
plex of PA28a has been solved (Figure 6c). The PA28a 
monomer is almost exclusively a-helical; three of its four 
long helices form a tightly packed cone, the fourth helix 
(H1) contains a sharp kink allowing it to interconnect to 
the neighboring monomer. The complex is traversed by 
a channel of 2 nm in diameter at its apex and 3 nm at 
the base, which docks to the proteasome (Figure 6b). 
It is suggested that the binding of PA28 causes a confor- 
mational change that opens the a-ring channel of the 
proteasome (Knowlton et al., 1997). However, this alone 
can hardly explain all the aforementioned modulatory 



effects of this regulator. In fact, it may have more far- 
reaching allosteric effects, perhaps even transmitted to 
the active sites in the central cavity. We observed that 
the binding of PA28 weakens the interaction between 
rings in the 20S proteasome to the extent of partial 
disassembly (W. B., unpublished data); this may facili- 
tate the efflux of degradation products and, as part of 
a functional cycle, may prepare the proteasome for an- 
other round of degradation (Lupas et al., 1995). 

Other Self-Compartmentalizing Proteases 
Self compartmentalization has begun to emerge as a 
principle common to several proteases. The recently 
determined crystal structure of CIpP (Wang et al., 1 997b) 
revealed a striking example of evolutionary conver- 
gence. In spite of being unrelated in sequence and fold, 
it forms a homooligomer built of two heptameric rings. 
This encloses a cavity with dimensions very close to 
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those of the central cavity of the 20S proteasome, which 
is accessible through an axial channel approximately 
1 .2 nm in diameter (Figure 7). Even the disposition of 
active sites, here formed by a classical Ser-His-Asp cat- 
alytic triad, and the distance between them (2.6 nm) 
almost matches the geometry inside the proteasome. 
Also, Clp is synthesized as a precursor, which is pro- 
cessed autocatalytically to its mature form. Like the 20S 
proteasome, CIpP associates with an ATPase. ClpA, 
which docks to both ends where the entry-exit channel 
is located (Kessel et al., 1995). 

Recently, two energy-independent large protease 
complexes have been described, which are traversed 
by a channel that widens into a large central cavity: 
bleomycin hydrolase/Gal6 (Joshua-Tor et at., 1995) and 
the tricorn protease (Tamura et al., 1996). Bleomycin 
hydrolase is a papain-type cysteine protease; the active 
site of the tricorn protease is not yet fully defined. The 
720 kDa tricorn hexamer, which has trypsin-like pepti- 
dase activity, assembles further in vivo to form an icosa- 
hedral capsid composed of 20 hexamers. This 15 MDa 
complex is likely to act as an organizing center, or scaf- 
fold, for several aminopeptidases. Interaction with the 
tricorn core complex enhances, possibly by an allosteric 
mechanism, intrinsic activities of these peptidases and 
elicits new ones found neither in the aminopeptidases 
nor in the tricorn alone. 

Conclusions and Outlook 

In recent years, it has become clear that the proteasome 
is a highly conserved proteolytic complex common to 
all three domains of life. In eukaryotic cells, the 20S 
proteasome is the core complex of an energy-depen- 
dent protein degradation machinery that equals the pro- 
tein synthesis machinery in its complexity. While the 
ubiquitin-proteasome pathway of protein breakdown is 
essential for eukaryotes, prokaryotes possess redun- 
dant proteolytic systems. The principle of self-compart- 
mentalization is common to many of them. In view of 
this redundancy, it is not surprising that proteasomes 
are not essential in archaea and bacteria under normal 
(i.e., nonstress) growth conditions. At present, the biol- 
ogy of proteolysis is not well understood for prokary- 
otes; it is largely unknown what the natural substrates 
are, how these substrates are targeted, and to what 
extent the substrate spectra of the various energy- 
dependent proteolytic systems might overlap. 

Prokaryotic proteasomes have, by virtue of their rela- 
tive simplicity, taken a pivotal role in elucidating the 
structure and enzymatic mechanism of the 20S com- 
plex. In spite of these advances, several important is- 
sues remain unresolved. We do not yet know how, pre- 
cisely, the polypeptide chain is translocated into the 
barrel-shaped complex, nor do we know how the degra- 
dation products are discharged. Is some kind of ratch- 
eting mechanism involved in directing the substrates to 
the proteolytic nanocompartment? If so, how does it 
work in molecular terms? Another major challenge is to 
achieve a better understanding of the principles that 
govern the ordered assembly of the 14 distinct but very 
similar subunits. The finding in higher eukaryotes that 
inducible subunits can replace constitutive subunits fur- 
ther adds to the complexity of the assembly pathway. 



In eukaryotic cells, the physiologically relevant form 
of the proteasome is the 26S complex. The "parts list" 
of the regulatory 19S complex may be near completion 
now, but the precise function of the vast majority of 
these parts remains enigmatic, and hitherto no blueprint 
exists describing how the parts are assembled. It poses 
a considerable challenge to structural biologists to es- 
tablish the structure and topology of the 19S subunits, 
particularly in view of the flexibility and variability re- 
vealed by initial studies. The ultimate goal must be to 
achieve a precise description of the sequence of events 
from the recognition of a ubiquitylated substrate protein 
to its translocation into the 20S proteolytic core. Like 
the ribosome and thespliceosome, the 26S proteasome 
is known to be an energy-dependent machinery. How- 
ever, the popular but unproven hypothesis that ATP 
hydrolysis is used to fuel the disassembly and unfolding 
of substrate proteins awaits experimental verification. 
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